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Abstract. Traditionally, the major objective in phase I trials is to iden- 
tify a working-dose for subsequent studies, whereas the major endpoint 
in phase II and III trials is treatment efficacy The dose sought is typ- 
ically referred to as the maximum tolerated dose (MTD). Several sta- 
tistical methodologies have been proposed to select the MTD in cancer 
phase I trials. In this manuscript, we focus on a Bayesian adaptive 
design, known as escalation with overdose control (EWOC). Several 
aspects of this design are discussed, including large sample properties 
of the sequence of doses selected in the trial, choice of prior distri- 
butions, and use of covariates. The methodology is exemplified with 
real-life examples of cancer phase I trials. In particular, we show in the 
recently completed ABR-217620 (naptumomab estafenatox) trial that 
omitting an important predictor of toxicity when dose assignments to 
cancer patients are determined results in a high percent of patients 
experiencing severe side effects and a significant proportion treated at 
sub-optimal doses. 

Key words and phrases: Cancer phase I trials, dose-limiting toxicity, 
escalation with overdose control, tolerated dose, optimal Bayesian fea- 
sible. 



1. INTRODUCTION 

The main objective in cancer phase I clinical tri- 
als is to identify a tolerable dose of a cytotoxic or 
therapeutic agent for subsequent studies. Phase I 
trials represent the first testing of an investigational 
agent or combination of agents whose safety profile 
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has been established individually. These trials typ- 
ically enroll patients with advanced cancer stages 
and who have exhausted available standard treat- 
ment options [24]. 

Cancer phase I trials are carried out sequentially, 
assigning dose levels to subjects based on the ob- 
served side effects of the previously treated patients. 
From a safety and therapeutic perspective, these tri- 
als should be designed to minimize the number of 
unacceptable toxic events and maximize the num- 
ber of patients treated at an optimal dose. Ideally, 
the design should control the probability of over- 
dosing patients at each stage of the trial, produce 
a sequence of doses that converge to the MTD, and 
should take into account the heterogeneous nature 
of cancer phase I trial patients [29] . 

Decisions to escalate or de-escalate dose levels in 
cancer phase I trials are made after one cycle of 
therapy to patients. The length of a cycle is usually 
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between 3 and 6 weeks. Therefore, the target phase 
I dose is typically defined in terms of treatment- 
related side effects, ignoring treatment efficacy. This 
is due to the fact that treatment efficacy, expressed 
as a reduction in tumor size or an increase in sur- 
vival, requires months (if not years) of observation 
[21, 34], a length of time far greater than the length 
of one cycle of therapy. Thus, it can be stated that 
the main objective of a cancer phase I clinical trial 
is to determine a safe dose of a new drug or com- 
bination of drugs for subsequent clinical evaluation 
of efficacy. This dose is known as the maximum tol- 
erated dose (MTD), or phase II dose. Specifically, 
the MTD, 7, is defined as the dose expected to pro- 
duce some degree of medically unacceptable, dose- 
limiting toxicity (DLT) in a prespecified proportion 
9 of patients [14], 

(1) P(DLT|Dose = 7) = 9. 

The target probability of DLT 9 depends on the 
severity of the treatment-attributable toxicity. It is 
set relatively high when the DLT is reversible or 
nonfatal condition, and low if it is life-threatening 
[3] . Ting [33] and Rosenberger and Haines [27] gave 
good reviews of statistical methods for dose find- 
ing in cancer phase I trials. In particular, the widely 
used continual reassessment method (CRM) proposed 
by O'Quigley, Pepe and Fisher [21] and its exten- 
sions by Faries [12], Goodman, Zahurak and Pi- 
ant adosi [15], Moller [19], Piantadosi, Fisher and 
Grossman [23], and Storer [28], and the escalation 
with overdose control (EWOC) method proposed by 
Babb, Rogatko and Zacks [3], Zacks, Rogatko and 
Babb [36], Babb and Rogatko [4], Tighiouart, Ro- 
gatko and Babb [31] and Rogatko et al. [25] are 
Bayesian adaptive and produce consistent sequences 
of doses under some model assumptions and regu- 
larity conditions. These designs can be easily imple- 
mented in practice using published tutorials and free 
interactive software; see, for example, the works of 
Garrett [13], Zohar et al. [38], Xu, Tighiouart and 
Rogatko [35], and Rogatko, Tighiouart and Xu [26]. 

In this article, we review several aspects of EWOC, 
including large sample properties, choice of prior dis- 
tributions, and use of covariates. The methodology 
is exemplified with cancer phase I clinical trials we 
designed and conducted at Fox Chase Cancer Center 
in Philadelphia and the Winship Cancer Institute in 
Atlanta. 

This article is organized as follows. In Section 2, 
we introduce the phase I design known as EWOC 



and review its large sample properties. We illustrate 
its implementation using a real-life example. An ex- 
tension of this design to account for patients' spe- 
cific characteristics is described in Section 3 and the 
methodology is illustrated by a recently completed 
phase I cancer trial. Section 4 contains some con- 
cluding remarks and discussion. 

2. ESCALATION WITH OVERDOSE 
CONTROL 

Denote by Y the binary indicator of DLT for a 
patient given dose x. Assume that there exist x* 
and x** , x* < x** such that 

(2) p(Y = l\x = x*) = 0, 

(3) p(Y = l\x = x**) = l-e, 

where < e < 1 is known and 9 < 1 — e. 

Let F{z) be a strictly increasing cumulative dis- 
tribution function (c.d.f.) having probability density 
function f{z). We consider a dose-toxicity relation- 
ship of the form 

P(Y = l\x) 

(4) 

where (3 is unknown, and < (3* < (3 < (3** for some 
positive real numbers f3* and 13**. This model as- 
sumes that the quantiles of F are linear in the log- 
standardized dose z = log [(a; — x*)/(x** — x*)]. An 
example of F that is commonly used in practice is 
the logistic model F(z) = e z /(l + e z ). It is easy to 
verify that model (4) satisfies the constraints (2) 
and (3). The condition (3 > implies that the prob- 
ability of DLT is an increasing function of dose. Let 
<p = i ? ~ 1 (l — e) — F' 1 (9). Using (4), it can be shown 
that the MTD 7 defined in (1) is 

(5) j = x * + ( x ** - x *) e -<t>/P. 

This also shows that 7 £ [x*,x**]. Let 7' = log((7 — 
x*)/(x** -x*)) be the MTD on the log-standardized 
scale. Then (5) implies that 7' = —<p/(3. 

2.1 Dose Escalation Based on Bayesian 
Estimates 

Let G(u) = F(F- 1 (9) + + u),g(u) = G'{u) and 
z\ = —(f)/ (3* be the level assigned to the first patient. 
Then, 

G((3 Z1 ) = F(F~ 1 (9) + F~ x (l - e) - F^{9) + (3 Zl ) 
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= F(F~ l (l-e) + p Zl ) 

= P(Y = 1\ Z1 ) < F(F-\l -e) + P* Zl ) 

= F(F-\e)) = e, 

since z\ < and F(z) is strictly increasing. This 
shows that this log-standardized dose z\ is safe in 
the sense that the probability of DLT at this level 
does not exceed 6. Let D n = {(zi,Yi),i = 1, . . . , n} 
be the data after enrolling n patients to the trial 
where Yi is the observed DLT status of the patient 
getting level Zi, Zi G L* = \-p, —$*]■ 

Let be a prior density function for the pa- 
rameter p on [f3*,f3**] and U n (/3) = U(/3\D n ) the 
posterior c.d.f. given the data D n . Let < a < 1. 
A sequence of dose levels z n such that 



(6) 



P(Zn< 



-MP\Dn- 



> 1 



a 



for all n > 2 is called Bayesian- feasible at level (1 — 
a); see the article by Zacks, Rogatko and Babb [36]. 
Let 



(7) 







,n> 2. 



Then, it is easy to verify that for all n > 2, z n °^ 
is Bayesian-feasible at level (1 — a). The choice of 
Zn as the log-standardized dose levels in the trial 
implies that the posterior probability of exceeding 
the MTD is equal to the feasibility bound a. Let 
T n = a(D n ) be the sigma-field generated by D n and 
ift( a ' be the class of all Bayesian-feasible sequences 
z n G ?n of level (1 — a). 

Definition 2.1. A sequence of levels {z*,n > 
1} G is called optimal Bayesian-feasible at level 
(1-a), if for all iV>l, 



N 



N 



} 



n=l 



inf JT E hih'-Zn) + }, 



n=l 



where z + = zl(z > 0) denotes the positive part of a 
random variable. 

This means that minimizes the average amount 
by which patients are underdosed. Using the law 
of total expectation, Zacks, Rogatko and Babb [36] 
showed that z n °^ is optimal Bayesian-feasible. Con- 
ditions under which this sequence converges to the 
true MTD in probability are stated in the next the- 
orem. 

Theorem 2.1. Suppose that for /3 6 [/?*, /?**]. ■ 



1. < e x < G(-M/n < G(-Po<j>/P**) < 1 " £• 

2. < e 2 < inf{5(/3 x) : x € L*} < sup{g(/3 x) : x G 
L*}<g*. 

3. g(x) is continuously differentiable. 

4. -co < m£{g , (f3 x) :x G L*} < sup{g' (f3 x) : x G 
L*}<oo. 

5. h(P) is uniform on [/?*,/?**]. Then, z n °^ —(f)/ 
f3o as oo. 

Proof. See the article by Zacks, Rogatko and 
Babb [36]. □ 

2.2 Coherence of EWOC 

Coherence of adaptive designs was introduced by 
Cheung [7] in the context of cancer phase I clinical 
trials. Due to ethical concerns, the dose of a cyto- 
toxic agent for the next patient in a trial should 
not be higher than the current allocated dose if the 
current patient exhibits DLT. Likewise, the dose for 
the next patient should not be lower than the cur- 
rent one if the current patient does not exhibit DLT. 
This desirable property is known as coherence and 
Cheung [7] showed that CRM is coherent. The au- 
thor also showed how the coherence property can 
be lost when ad hoc modifications are introduced 
to CRM. In this section, we show that EWOC as 
described in Section 2.2 is coherent. 

Let F{x, 7) = P(Y = l\x) be the model given in 
(4) reparameterized in terms of the MTD 7. Let 
D n = {(x\,Yi), . . . ,(x n ,Y n )} be the data generated 
using the EWOC scheme described in Section 2.2. 
This design is said to be coherent in escalation if for 
all n>2,x n > x n _i whenever Y n _\ = 0. The design 
is said to be coherent in de-escalation if for all n > 
2,x n < x n -\ whenever Y n -\ = 1. The design is said 
to be coherent if it is coherent in both escalation 
and de-escalation. 

Theorem 2.2. Suppose that F(x, r y) is nonin- 
creasing in gamma for fixed dose x. Then the EWOC 
scheme described in Section 2.2 is coherent. 

The proof of Theorem 2.2 is given in the Appendix. 
It is easy to verify that the monotonicity condition 
on F(x,j) is satisfied by model (4), and in particu- 
lar, the logistic function. 

2.3 Two- Para meter Logistic Model 

Denote by X m i n and X max the minimum and max- 
imum dose levels available for use in the trial. One 
chooses these levels in the belief that AT m i n is safe 
when administered to humans. Babb, Rogatko and 
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Zacks [3] considered a two-parameter logistic model 
for the dose-toxicity relationship: 



(8) P(Y = l\Dose 



exp(/3 + fax) 
1 + exp(/3 + fax) ' 



where we assume that fa > so that the probability 
of DLT is a monotonic increasing function of dose. 
Model (8) is reparameterized in terms of the MTD 
7 and the probability of DLT at the starting dose 
po, parameters clinicians can easily interpret. This 
might be advantageous since 7 is the parameter of 
interest and one often conducts preliminary studies 
at or near the starting dose so that one can select a 
meaningful informative prior for pq. Using the def- 
inition of the MTD in (1) and (8), it can be shown 
that 



(9) 



/3b 



fa 



X min logit(fl) - 7logit(p ) 

^min T 

logit(p o )-logit(0) 



The second equation in (9) shows that the as- 
sumption that fa > implies < po < 0. 

2.3.1 Trial design After specifying a prior distri- 
bution h(po,j) for (po,?), denote by n n (7) the 
marginal posterior c.d.f. of 7 given D n . EWOC can 
be described as follows. The first patient receives 
the dose x\ = X m { n and conditional on the event 
{yi = 0}> the (n + l)st patient receives the dose 
x n+ \ = II~ 1 (q) so that the posterior probability of 
exceeding the MTD is equal to the feasibility bound 
a. If y% = 1, we recommend that the clinician stops 
the trial. Calculation of the marginal posterior dis- 
tribution of 7 is performed using numerical integra- 
tion; see [3]. Often in practice, phase I clinical trials 
are typically based on a small number of prespeci- 
fied dose levels d±,...,d r . In this case, the (n + l)st 
patient receives the dose 



d n+ \ = max {di : di 

Ki<r 



X n +1 < Ti 



and U n (x n+1 ) - a < T 2 }, 

where T\ , T2 are nonnegative numbers we refer to as 
tolerances. We note that this design scheme does not 
require that we know all patient responses before we 
can treat a newly accrued patient. Instead, we can 
select the dose for the new patient on the basis of 
the data currently available. At the conclusion of 
the trial, the MTD is estimated by minimizing the 



la{x,y) 



posterior expected loss with respect to some suit- 
able loss function I. One should consider asymmet- 
ric loss functions since underestimation and overes- 
timation have very different consequences. Indeed, 
the dose x n selected by EWOC for the nth patient 
corresponds to the estimate of 7 having minimal risk 
with respect to the asymmetric loss function 

a(j-x), 

if x < 7, that is, if x is an underdose, 
(l-a)(x-7), 

if x > 7, that is, if x is an overdose. 

Note that the loss function l a implies that for any 
S > 0, the loss incurred by treating a patient at 5 
units above the MTD is (1 — a) /a times greater 
than the loss associated with treating the patient at 
5 units below the MTD. This interpretation might 
provide a meaningful basis for the selection of the 
feasibility bound. The above methodology can be 
implemented using the user-friendly software of Ro- 
gatko, Tighiouart and Xu [26]. 

2.3.2 Correlated priors on po and 7 In models (4) 
and (8), we assumed that the support of the MTD 
was strictly contained in and [X m \ a , A" max ] , 

respectively. The assumption that 7 is bounded from 
above may be too restrictive. In the absence of toxi- 
city, this assumption causes the dose escalation rate 
to slow down and in general, the target MTD will 
never be achieved if it lies outside the support of 7. 
Furthermore, since the support of the probability of 
DLT at the initial dose po is [0, 9] and 7 is a function 
of 9, the assumption of prior independence between 
po and 7 may not be realistic. Intuitively, the closer 
po is to 9, the closer the MTD is to A m ; n . Tighiouart, 
Rogatko and Babb [31] introduced a class of corre- 
lated priors for h(po,~f) on [0,0] x [X m i n ,oo) using 
truncated normal distributions for the parameter 7. 
They showed that a candidate joint prior for (/?()> 7) 
with negative a priori correlation structure results 
in a safer trial than the one that assumes indepen- 
dent priors for these two parameters while keeping 
the efficiency of the estimate of the MTD essentially 
unchanged. 

2.4 EWOC with Varying Feasibility Bound 

Many of the phase I cancer trials the authors de- 
signed at Fox Chase Cancer Center and Winship 
Cancer Institute used a variable feasibility bound 
a; see the work of Babb and Rogatko [2, 4], Cheng 
et al. [6], Tighiouart and Rogatko [29, 30], and Xu, 
Tighiouart and Rogatko [35]. The rationale behind 
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this approach is that uncertainty about the MTD 
is high at the onset of the trial and a small value 
of a offers protection against the possibility of ad- 
ministering dose levels much greater than the MTD. 
As the trial progresses, uncertainty about the MTD 
declines and the likelihood of selecting a dose level 
significantly above the MTD becomes significantly 
smaller. However, design operating characteristics 
were not studied. Chu, Lin and Shih [9] compared 
the performance of different versions of CRM with 
EWOC with both constant and varying a. The de- 
sign of EWOC with varying a was termed "hybrid 
design." The authors conducted extensive simula- 
tions to compare these designs in terms of (1) the 
proportion of patients given doses above the "true" 
MTD and (2) the proportion of times the recom- 
mended dose is the "true" MTD after each patient 
is enrolled in the trial and his or her DLT status is re- 
solved. It was found in general that both the hybrid 
and CRM designs had better convergence rate than 
EWOC with fixed a and that EWOC with fixed and 
varying feasibility bound a provide a better over- 
dose protection than the CRM designs in the sense 
that fewer patients are given doses above the "true" 
MTD. 

2.5 Example 

EWOC was used to design a phase I clinical trial 
that involved the Rl 15777 drug at Fox Chase Can- 
cer Center in Philadelphia, USA in 1999. R115777 
is a selective nonpeptidomimetic inhibitor of farne- 
syltransferase (FTase), one of several enzymes re- 
sponsible for posttranslational modification that is 
required for the function of p21(ras) and other pro- 
teins. This was a repeated dose, single center trial 
designed to determine the MTD of R115777 in pa- 
tients with advanced incurable cancer. The target 
probability of DLT was set to 9 = 1/3. The dose- 
escalation scheme was designed to determine the 
MTD of Rl 15777 when drug is administered orally 
for 12 hours during 21 days followed by a 7-day 
rest. This constitutes one cycle of therapy. Toxic- 
ity was assessed by the National Cancer Institute 
(NCI) Common Toxicity Criteria [20]. DLT was de- 
termined by week 3 of cycle 1, as defined by Grade 
III nonhematological toxicity (with the exception 
of alopecia or nausea/vomiting) or hematological 
Grade IV toxicity with a possible, probable or likely 
causal relationship to administration of R115777. 
Dosing continued until there was evidence of tumor 



progression or DLT leading to permanent discontin- 
uation. The initial dose judged to be safe by the clin- 
ician for this study was A" m ; n = 60 mg/m 2 and the 
maximum allowable dose was X m3iX =600 mg/m 2 . 
More details about the dosing regimen for this trial 
can be found in the work of Tighiouart and Rogatko 
[29]. Assuming vague priors for po on [0,8] and 7 on 
[60,600], the prior probability density of (pojT) is 

MPo,7) 

= f 1/180, if Go , 7) e [0, 1/3] x [60, 600] , 
\ 0, otherwise. 

Thus, po and 7 are independent a priori, uniformly 
distributed over their corresponding interval. Figure 
1 shows the posterior distributions of the MTD as 
the trial progressed and Figure 2 shows the pos- 
terior density of the MTD after 33 patients have 
been treated. The posterior mode is 323 which cor- 
responds to the 47th percentile of the distribution. 
In this trial, we used a variable feasibility bound a, 
starting with a = 0.3, this value being a compromise 
between the therapeutic aspect of the agent and its 
toxic side effects. As the trial progressed, a increased 
in small increments until a = 0.5 so that, by the end 
of the trial, the given dose corresponds to the 50th 
percentile, that is, the median of the marginal poste- 
rior probability density function. Thus, the dose to 
be given to the 34th patient is 328. The 95% highest 
posterior density interval is [160.5,536.1]. 

3. ADJUSTING FOR PATIENTS' BASELINE 
COVARIATES 

A key assumption implied by the definition of the 
phase I target dose (MTD) is that every subgroup of 
the patient population has the same MTD. That is, 
it is assumed that the patient population is homo- 
geneous in terms of treatment tolerance and every 
patient should be treated at the same dose. As a 
result, no allowance is made for individual patient 
differences in susceptibility to treatment [11]. 

Babb and Rogatko [4] extended EWOC to allow 
the incorporation of information concerning individ- 
ual patient differences in susceptibility to treatment. 
The method adjusts doses according to patient-specific 
characteristics while safeguarding against overdos- 
ing. 

3.1 Model 

Let W be a p-dimensional baseline covariate vec- 
tor. We consider the dose-toxicity model 

P(Y = l\Dose = x,W = w) 
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Fig. 1. Posterior density of the MTD when the 
number of treated patients (from bottom to top) is 
1,5,10,15,20,25,30,33. 
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Fig. 2. Posterior density of the MTD after 33 patients have 
been treated. The posterior mode is 323 (47th percentile) and 
the median and dose to be given to the 34th patient is 328. 
The 95% highest posterior density interval is [160.5,536.1]. 



(10) 



exp(/3 + Pix + rj'w) 



1 + exp(/3 + P\x + rfw) 

where r] € R p is the effect of the baseline covari- 
ate vector on DLT. Let p x (w) = P(Y = 1| Dose = 
x,W = uu). We assume that (3\ > so that p x (w) 
is an increasing function of dose x for fixed w. The 



MTD for a patient with baseline covariate value w 
is defined as the dose "f(w) that results in a proba- 
bility equal to 6 that a DLT will manifest. It follows 
from model (10) that 



(ii) 7 H = /3r 1 



log 



1 



A) 



rj'w 



As in Section 2.3, we reparameterize this model 
in terms of (^(w*),p) for a selected value of the 
baseline covariate vector w = w* and p is a (p + 1)- 
dimensional nuisance parameter. 

3.1.1 Trial design Let h(p, r y(w*)) be a prior dis- 
tribution for (p,j(w*)) and denote by H n ,w* (t(^*)) 
the marginal posterior c.d.f. of 7(10*) given the data 
Ai = { (?i , Yi , wi) , . . . , (x n , Y n , w n ) }. The first patient 
receives the dose x\ = X m \ n and conditional on the 
event {yi = 0}, the (n + l)st patient with covariate 
vector value w n+ i receives the dose x n+ \ = 
H~* w (a) so that the posterior probability of ex- 
ceeding the MTD is equal to the feasibility bound 
a. Note that here, n+1 (-) is the inverse c.d.f. of 

(tKh-i)). 

For a binary covariate W, Tighiouart, Rogatko 
and Xu [32] studied operating characteristics of this 
model with extensive simulations under different sce- 
narios for the underlying true MTDs. They found 
that if the two MTDs are different and the design 
does not adjust for this heterogeneity, then the trial 
will result in more patients being overdosed. If the 
two MTDs are different and parallel trials are used, 
then the estimates of the MTDs are less efficient. Fi- 
nally, if the two MTDs are the same and the design 
adjusts for patients' heterogeneity, then few more 
patients can be overdosed if the true MTD is low 
relative to a design with no covariate but the differ- 
ence is not practically important. Thus, we stand to 
lose little if we do include a statistically nonsignif- 
icant covariate in the model. This conclusion is in 
agreement with the findings of O'Quigley, Shen and 
Gamst [22]. 

3.2 Example 

ABR-217620 (naptumomab estafenatox) is a re- 
combinant fusion protein that consists of the 5T4Fab 
moiety genetically fused to the engineered super- 
antigen variant SEA/E-120. This fusion protein is a 
new generation tumor-targeted superantigen based 
on the previously described ABR-214936 
(anatumomab mafenatox). ABR-217620 was 
designed to reduce antigenicity and toxicity. We use 
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Fig. 3. Dose allocation as a function of baseline Anti 
SEA/E120 during the trial for all 39 patients. The solid line is 
the MTD conditional on the covariate Anti SEA/E120 which 
corresponds to the posterior median of the conditional poste- 
rior distribution of the MTD and the dashed lines delimit the 
95% Bayesian credible region. 

model (10) with W = C representing the Anti 
SEA/E120 covariate to design a phase I study for 
nonsmall cell lung cancer (NSCLC) patients. The 
goal is to determine the MTD of ABR-217620 as a 
function of patients' baseline Anti SEA/E120 and 
test whether the neutralizing effect of Anti SEA/ 
El 20 on the cytotoxic agent which was observed by 
Babb and Rogatko [4] has been reduced or elimi- 
nated with this new agent. The modeling approach 
is similar to the PNU trial described in [4], the tar- 
get probability of DLT 6 was set to 0.2. The fea- 
sibility bound a was set at 0.25 for the first nine 
patients, then was increased to a maximum value 
of 0.5 by increments of 0.05 every time a new pa- 
tient was enrolled in the trial and a DLT assessment 
was resolved. Based on preliminary clinical data, the 
minimum and maximum allowable doses for ABR- 
217620 set by the clinicians are x m \ a = 1 ug/kg and 
x max = 100 ug/kgl. The minimum and maximum 
values of Anti SEA/E120 anticipated in the trial are 
c\ = pmol/ml and c 2 = 200 pmol/ml, respectively. 
As in the PNU trial [4], we reparameterize model 
(10) in terms of 7 max = j(c 2 ),pi = Px mm (ci), p 2 = 
Px mm (c2) with (7 m ax, Pi, P2) uniformly distributed on 
{(x, y,z):ye(0,9],ze (0, y), x € [1, 100]} a priori. 

Figure 3 shows the doses allocated to all 39 pa- 
tients clS 8l function of their pretreatment Anti 



Fig. 4. Dose allocation as a function of baseline Anti 
SEA/E120 during the trial for all 28 NSCLC & PC patients. 
The solid line is the MTD conditional on the covariate Anti 
SEA/E120 which corresponds to the posterior median of the 
conditional posterior distribution of the MTD and the dashed 
lines delimit the 95% Bayesian credible region. 

SEA/E120. The solid line is the estimated condi- 
tional MTD, obtained by taking the posterior me- 
dian of the marginal distribution of the MTD condi- 
tional on the covariate Anti SEA/E120. The dashed 
lines delimit the 95% Bayesian credible region. Six 
patients experience DLT (15.4%) and the MTD seems 
to indicate that the neutralizing effect of Anti 
SEA/E120 has been reduced considerably. The pro- 
tocol was amended to include patients with renal cell 
(RCC) and pancreatic cancer (PC). Figures 4 and 5 
show the doses allocated to NSCLC &: PC patients 
and RCC patients, respectively. The solid line repre- 
sents the conditional MTD obtained after fitting the 
data in each group to model (10), reparameterized 
in terms of (7 max , pi, p 2 ). This shows that NSCLC 
& PC patients were treated at sub-optimal doses 
and RCC patients were overdosed, with 36.4% ex- 
periencing DLT, way above the target probability of 
DLT 9 = 0.2. 

The effects of Anti SEA/E120 and type of cancer 
were tested by fitting model (10) with W = (C,Z), 
where C is the baseline Anti SEA/E120 and Z is a 
binary covariate representing the cancer type, Z = 
z\ = 1 for NSCLC and PC patients and Z = z 2 = 
for RCC patients. To be consistent with the pri- 
ors used to design the trial, we reparameterized the 
model in terms of 7 max = 7(c 2 , zi), pi = p Xmin (ci , z x ) , 
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Anti-SEAJE120(pmol/nil) 

Fig. 5. Dose allocation as a function of baseline Anti 
SEA/E120 during the trial for all 11 RCC patients. The solid 
line is the MTD conditional on the covariate Anti SEA/E120 
which corresponds to the posterior median of the conditional 
posterior distribution of the MTD and the dashed lines delimit 
the 95% Bayesian credible region. 

P2=Px min ( c 2,zi) and P3=p Xmin {ci,z 2 ). Independent 
uniform priors are placed on these parameters. It 
can be shown that this induces priors centered at 
for the Anti SEA/E120 and cancer type effect pa- 
rameters 7/i and 7/2. We used WinBUGS [18] to fit 
this model and the 95% HPD intervals for the pa- 
rameters 7/i and 7/2 were (—0.14, 0.24) and (—4.6, 0.6), 
respectively. We conclude that the agent ABR-2 17620 
was successful in reducing the neutralizing capacity 
of Anti SEA/E120 and that the phase II dose should 
be carefully tailored to account for patients' cancer 
type and hence avoid excessive overdosing and un- 
derdosing patients. 

4. DISCUSSION 

In this article, we described EWOC, a Bayesian 
dose finding design for cancer phase I clinical trials. 
The method is flexible enough to allow prior infor- 
mation about the drug from laboratory or animal 
studies to be incorporated in the model, is coher- 
ent, makes use of all the information available at the 
time of each dose assignment and controls the prob- 
ability of overdosing patients at each stage. EWOC 
can be implemented with the user-friendly software 
EWOC 2.1 [26] or WinBUGS [18] for general class 
of prior distributions [31]. The two-parameter model 



described in Section 2.3 accounts for the uncertainty 
regarding the probability of DLT at the initial dose 
by placing a vague prior distribution on pq. If ex- 
pert opinion about this parameter is available, then 
it should be incorporated in the prior for pq. In par- 
ticular, if the clinician strongly believes that this 
prior can be approximated by a point mass distri- 
bution, then the one-parameter model described in 
Section 2.1 may be used. In any case, design op- 
erating characteristics should be performed with a 
sensitivity analysis about the parameter po when 
designing the trial. Our own experience in design- 
ing dose-finding studies in cancer is that the uncer- 
tainty of the clinicians regarding the probability of 
DLT at the initial dose is large. Thus, in more than 
ten years of designing trials with EWOC, the use 
of a one-parameter model was never chosen by the 
clinical researchers we worked with. 

It is worth highlighting that the values of a and 
6 are chosen independently when the trial is de- 
signed. They have distinct meanings and functions. 
For example, taking a value of a greater than 9 
only affects the loss function used to estimate the 
next dose and the MTD at the conclusion of the 
trial. It does not mean that patients are given doses 
at a rate above the target probability of DLT 9. 
When a = 0.5, the method differs from CRM in the 
sense that the loss functions are different. The loss 
function for EWOC is taken with respect to the pa- 
rameter 7, the MTD. The overprotection property 
of EWOC is with respect to the posterior distribu- 
tion of the MTD, given the data. The overprotection 
property states that the posterior probability of ex- 
ceeding the MTD given the current data is bounded 
by a. This overprotection is as good as the posterior 
distribution of the MTD at each stage of the trial. 
For instance, if we used a flat prior on the MTD 
and the true MTD turns out to be very close to the 
initial dose, then it would take many patients for 
the median of the posterior distribution to cluster 
around the true MTD. 

Another aspect of cancer phase I clinical trials not 
discussed here is the choice of the number of patients 
to enroll. Most sample size recommendations in the 
literature are based on prespecified stopping rules; 
see, for example, the work of Zohar and Chevret 
[37] on selecting the number of patients by consid- 
ering different stopping rules using the CRM. Lin 
and Shih [17] and Ivanova [16] described sample size 
recommendations based on the expected number of 
patients allocated to each dose selected from a set 
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of prespecified dose levels. However, these methods 
apply to a prespecified set of discrete doses and it 
is not clear how they can be applied to continu- 
ous doses. Unlike the frequentist approach, there is 
no consensus on a specific Bayesian method for the 
sample size determination problem; see the article 
by Adcock [1] for a review of Bayesian approaches. 
We conducted extensive simulation studies in order 
to estimate the sample size based on a desired ac- 
curacy of the Bayes estimate on the average. Specif- 
ically, we determined the minimum number of pa- 
tients so that the posterior variance of the MTD on 
the average over all possible trials is no more than 
a specified margin. Tabulated values of the average 
mean posterior standard deviation, length of 90% 
and 95% HPD intervals for different values of the 
target probability of DLT 9 are available from the 
authors upon request. 

The methodology described in this article assumes 
that DLT status is binary and does not account for 
patients' time to toxicity. Information on time to 
DLT is crucial to clinicians in that it permits a dy- 
namic updating of the posterior distribution of the 
MTD based on the number of patients who experi- 
enced DLT and the ones who are still at risk. If new 
patients are eligible to enter the clinical trial while 
the DLT status of currently enrolled patients is still 
being resolved, then the new patients are allocated 
to the current established dose because it is not eth- 
ical to resolve DLT status at the expense of treat- 
ment delay. In this case, there is no adaptation to 
the most current information and the design will not 
be efficient. Time to DLT was first investigated by 
Cheung and Chapell [8] and later adapted to estima- 
tion of a maximum cumulative dose by Braun et al. 
[5] . These methods are extensions of the CRM and 
incorporate information on partially observed pa- 
tients using weighted binomial likelihoods. EWOC 
can be adapted to this framework by modeling time 
to DLT Cox [10] type model with a paramet- 
ric or nonparametric baseline risk of toxicity ho(t). 
We are currently investigating the performance of 
a large class of models within this framework via 
extensive simulations. 

APPENDIX 

Proof of Theorem 2.2. Let Il n (t) be the pos- 
terior c.d.f. of 7 given D n ,n > 2. Then, it suffices to 
show that: 

1. ~n. n (t) < H n -i(t) for all t whenever Y n = 0. 



2. H n (t) > IT„_i(i) for all t whenever Y n = 1. 
Let 

n 

Ln(7l^n)=n( F (^^)) Yi ( 1 -^^)) 1 ^ 
i=l 

be the likelihood function and h(j) be a proper 
prior distribution for 7. To simplify notation, let 
L n {"l\D n ) = L n ,(7), Fi(j) =F(x i ,'y) and suppose that 
x* = 0, x** = 1. Using Bayes' rule, the posterior c.d.f. 
II n (i) given D n is 

Jjf L n (i)h(i) dry 



n„(t) 



Jo L nh)h(-f) d-f 



Suppose that Y n = 0. Then, L n (^f) = L n _i(7)(l 
F n (j)) and 

n , t) = / *Ln-i(7)(l- ^(7)^(7) <*7 
£ W7)(l -FM)h( 7 )dj 
It follows that 

n n (t)-n n _!(t) 

j t L n „ 1 { 1 ){l-F n ( 1 ))h{ 1 )d 1 
^L n ^ 1 { 1 ){l-F n { 1 ))h{ 1 )d 1 

_ /q -£n-i(7)h(7) <*7 

Jo 1 L n-i (n)h{i) d i 



A~ 



t rl 



J 



L n „ 1 ( 7 )L n „ 1 ( 7 / )/ i (7)/i(7 / ) 



x [F n {i) - F n (rf)]di d 1 



where 



1 /•! 



A= L n _!(7)(l-Fn(7))%) 

Jo Jo 

L n ^ l { 1 ')h{i)d 1 'd 1 , 

n re (t)-n n _i(t) 

= A^ /7^n-i(7)£n-i(7>(7)^(70 



J 



x [F n {i)-F n { 1 ))did 1 
+ A- 1 J*£ L n _ 1 ( 7 )L re _ 1 (7 / )/ l ( 7 )/i(7 / ) 



t /•! 



A~ 



x [ J F ra (7 / )-F n (7)]d7 / d 7 

L n _x (7)^-1(70^(7)^(7') 
x [ J F n (7 , )-F„(7)]d7 , d 7 <0, 
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since F n {^f) is nonincreasing in 7. Hence, H n (t) < 
II n _i(t), which implies that U~ 1 (a) > II~_ 1 (q;), that 
is, x n +i > x n . Using a similar argument, one can 
show that II~ 1 (a) < 11"^ (a) if Y n = 1. This shows 
that EWOC is coherent. □ 
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